首页> 外文OA文献 >Self-supervised Deep Reinforcement Learning with Generalized Computation Graphs for Robot Navigation

【2h】

Self-supervised Deep Reinforcement Learning with Generalized Computation Graphs for Robot Navigation

机译：基于广义计算的自监督深度强化学习机器人导航图

代理获取

本网站仅为用户提供外文OA文献查询和代理获取服务，本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文，但由于OA文献来源多样且变更频繁，仍可能出现获取不到、文献不完整或与标题不符等情况，如果获取不到我们将提供退款服务。请知悉。

获取外文期刊封面目录资料

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Enabling robots to autonomously navigate complex environments is essentialfor real-world deployment. Prior methods approach this problem by having therobot maintain an internal map of the world, and then use a localization andplanning method to navigate through the internal map. However, these approachesoften include a variety of assumptions, are computationally intensive, and donot learn from failures. In contrast, learning-based methods improve as therobot acts in the environment, but are difficult to deploy in the real-worlddue to their high sample complexity. To address the need to learn complexpolicies with few samples, we propose a generalized computation graph thatsubsumes value-based model-free methods and model-based methods, with specificinstantiations interpolating between model-free and model-based. We theninstantiate this graph to form a navigation model that learns from raw imagesand is sample efficient. Our simulated car experiments explore the designdecisions of our navigation model, and show our approach outperformssingle-step and $N$-step double Q-learning. We also evaluate our approach on areal-world RC car and show it can learn to navigate through a complex indoorenvironment with a few hours of fully autonomous, self-supervised training.Videos of the experiments and code can be found at github.com/gkahn13/gcg

机译：使机器人能够自主导航复杂的环境对于实际部署至关重要。现有的方法通过使机器人维护世界的内部地图来解决此问题，然后使用定位和计划方法在内部地图中导航。但是，这些方法通常包括各种假设，计算量大，并且不能从失败中学习。相比之下，基于学习的方法会随着机器人在环境中的行为而提高，但由于其样本复杂性高而难以在现实世界中部署。为了解决使用少量样本学习复杂策略的需求，我们提出了一个通用计算图，该图包含基于值的无模型方法和基于模型的方法，并在无模型和基于模型之间进行插值。然后我们实例化该图以形成一个导航模型，该模型可以从原始图像中学习，并且采样效率高。我们的模拟汽车实验探索了导航模型的设计决策，并显示了我们的方法优于单步和$ N $步的双Q学习。我们还评估了我们在区域世界RC汽车上的方法，并表明它可以通过几个小时的完全自主，自我监督的训练来学习如何在复杂的室内环境中导航。实验视频和代码可在github.com/gkahn13上找到/ gcg

著录项

作者
Kahn, Gregory; Villaflor, Adam; Ding, Bosen; Abbeel, Pieter; Levine, Sergey;
展开▼
作者单位

展开▼
年度 2017
总页数
原文格式 PDF
正文语种
中图分类

相似文献

外文文献
中文文献
专利

1. Robot Navigation among External Autonomous Agents through Deep Reinforcement Learning using Graph Attention Network [J] . Tianle Zhang, Tenghai Qiu, Zhiqiang Pu, IFAC PapersOnLine . 2020,第2期

机译：通过使用曲线图注意网络，通过深度加强学习在外部自治代理中的机器人导航
2. Enhanced Autonomous Navigation of Robots by Deep Reinforcement Learning Algorithm with Multistep Method [J] . Xiaohong Peng, Rongfa Chen, Jun Zhang, Sensors and materials . 2021,第2期

机译：利用多步骤方法增强了机器人的自主导航
3. Air Learning: a deep reinforcement learning gym for autonomous aerial robot visual navigation [J] . Krishnan Srivatsan, Boroujerdian Behzad, Fu William, Machine Learning . 2021,第9期

机译：空中学习：自动空中机器人视觉导航的深度加强学习健身房
4. Self-supervised Deep Reinforcement Learning with Generalized Computation Graphs for Robot Navigation [C] . Gregory Kahn, Adam Villaflor, Bosen Ding, IEEE International Conference on Robotics and Automation . 2018

机译：用于机器人导航的广义计算图自我监督的深度加强学习
5. Optimizing Expectations: From Deep Reinforcement Learning to Stochastic Computation Graphs. [D] . Schulman, John. 2016

机译：优化期望：从深度强化学习到随机计算图。
6. Learning for a Robot: Deep Reinforcement Learning Imitation Learning Transfer Learning [O] . Jiang Hua, Liangcai Zeng, Gongfa Li, 2021

机译：学习机器人：深增强学习仿制学习转移学习
7. Deep-Reinforcement-Learning-Based Semantic Navigation of Mobile Robots in Dynamic Environments [O] . Linh Kastner, Cornelius Marx, Jens Lambrecht 2020

机译：动态环境中的移动机器人的深度加强学习的语义导航

Self-supervised Deep Reinforcement Learning with Generalized Computation Graphs for Robot Navigation

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅